First Report on the Genetic Diversity of Populations of Gossypium barbadense L. and Gossypium hirsutun L. in the Amazonian Native Communities, Cusco-Peru

The genus Gossypium has important ethnobotanical and economic value for Amazonian Native Communities (A.N.C.). However, little research has been undertaken on the distribution and genetic diversity of cotton populations maintained in the Peruvian rainforest. This work aims to present the first report on the genetic diversity of Gossypium spp. populations in the A.N.C. of the province of La Convención, Cusco-Peru. The methodology was based on exploring, collecting, identifying, and characterizing the Gossypium populations present in the A.N.C. Twenty-six descriptors were evaluated (9 quantitative and 17 qualitative), and with this information, distribution, correlation, and principal component (PC) analyses were carried out. As a result, plants of two species [G. barbadense L. (44 samples) and G. hirsutum L. (19 samples)], one variety [G. barbadense var. brasiliensis (75 samples)], and three previously unidentified variations (9 samples) were identified. Altogether, 147 samples were collected. G. barbadense var. brasiliensis, which was always found in association with other economic crops within an altitude range of 338 to 1086 m, was the most predominant (51%), distributed in eleven A.N.C. and always in small plots (up to 2 ha). G. barbadense L. was cultivated between 397 and 1137 m of altitude in eight A.N.C. in plots of up to 3 ha in marginal lands. G. hirsutum L., with a smaller distribution (13%), was found between 334 and 497 m of altitude in only three communities; this species is cultivated in marginal areas throughout the year. The variability found for the first two PCs when considering the quantitative and qualitative descriptors was high (74.7%) and moderate (48.2%), respectively. When combining all the descriptors, the analysis showed that the first two PCs accounted for 51.8% of the total variability of the data. The PCs of the two types of data and their combination confirmed that the three populations found were grouped. The nine undefined samples were close to or intermediate between the described ones, showing that these samples may be the result of spontaneous crosses; as such, these samples need to be better evaluated with other tools for further definition. The information obtained shows that in the A.N.C. of Cusco-Peru, there is variability conserved by the inhabitants, who have been able to maintain and use these genotypes, even from their Amazonian indigenous ancestry, and the environment has been able to generate variability among the species, as will be highlighted in future works.


Introduction
Peru has a high degree of ecological diversity of climates and species. As a result, information is still lacking on the biodiversity of Gossypium spp. in some Peruvian Amazo- Two species [G. barbadense L. (30%) and introduced populations of G. hirsutum L. (13%)] and one botanical variety [G. barbadense L. var. brasiliensis (51%)] were identified and characterized from a total of 147 samples collected in the two districts of La Convención Province, distributed in 12 A.N.C.
Among the collected samples, nine (6%) (Timpia community: five samples; Koribeni: one sample; and Poyentimari: three samples) merit further studies, such as molecular studies, to better describe and differentiate them from other species, because they showed different characteristics from the species identified in the other communities.

Distribution of G. barbadense L.
The native varieties of the species G. barbadense L. were distributed in eight Amazonian native communities in the districts of Echarati and Megantoni, from 334 m to 1137 m altitude (Table 1). In the Echarati district, 25 samples (57%) were collected, while in the Megantoni district, 19 samples (43%) were collected for this species.
When evaluating the characteristics of each sample, it was observed that for the samples of G. barbadense L., the flower was, on average, approximately 7.25 cm long and 3.89 cm wide, and the color of the petals was yellow to narcissus yellow, even cream colored, with the presence of macules ranging from shades of red to intense purple. Spots could be seen with a variation in intensity according to the size of the macula; the larger the spot, the  Figure 1A). In some native Amazonian communities, some flowers without maculae were observed, but with characteristics of G. barbadense L. ( Figure 1C). nian native communities in the districts of Echarati and Megantoni, from 334 m to 1137 m altitude (Table 1). In the Echarati district, 25 samples (57%) were collected, while in the Megantoni district, 19 samples (43%) were collected for this species.
When evaluating the characteristics of each sample, it was observed that for the samples of G. barbadense L., the flower was, on average, approximately 7.25 cm long and 3.89 cm wide, and the color of the petals was yellow to narcissus yellow, even cream colored, with the presence of macules ranging from shades of red to intense purple. Spots could be seen with a variation in intensity according to the size of the macula; the larger the spot, the more intense the color ( Figure 1A). In some native Amazonian communities, some flowers without maculae were observed, but with characteristics of G. barbadense L. (Figure 1C). G. barbadense L. is distributed in three different Amazonian communities in Echarati and five in Megantoni (Table 1). Its distribution is always in places with regular slopes and is associated with crops such as cassava (Manihot esculenta Crantz), corn (Zea mays), and orange (Citrus sinensis (L.) Osbeck), bananas (Musa spp.) and beans (Phaseolus vulgaris L.). It can be observed on the borders and roadsides and always near the Vilcanota River in the district of Echarati. In these areas, there was evidence of widespread fruiting and adaptation to temperatures ranging from warm to hot, considering the altitude at which the samples were collected (Table 1). In the District of Megantoni, these plants are cultivated in parcels ranging from one to three hectares. They are also found on the edges of farms, on the edges of irrigation ditches and rural roads, and in the gardens of houses. Clay soils are predominant in this region. This species is part of the flora of the banks of G. barbadense L. is distributed in three different Amazonian communities in Echarati and five in Megantoni (Table 1). Its distribution is always in places with regular slopes and is associated with crops such as cassava (Manihot esculenta Crantz), corn (Zea mays), and orange (Citrus sinensis (L.) Osbeck), bananas (Musa spp.) and beans (Phaseolus vulgaris L.). It can be observed on the borders and roadsides and always near the Vilcanota River in the district of Echarati. In these areas, there was evidence of widespread fruiting and adaptation to temperatures ranging from warm to hot, considering the altitude at which the samples were collected (Table 1). In the District of Megantoni, these plants are cultivated in parcels ranging from one to three hectares. They are also found on the edges of farms, on the edges of irrigation ditches and rural roads, and in the gardens of houses. Clay soils are predominant in this region. This species is part of the flora of the banks of the Urubamba River; its presence in the area is due to ancestral ethnobotanical/medicinal practices, as well as for the manufacture of the clothing which is typical of Andean culture.

Distribution of G. barbadense L. var. Brasiliensis
Brasiliensis of the species G. barbadense L. was the most frequently encountered native variety, with 75 samples collected ( Table 1). The variety is distributed between 336 and 1137 m.a.s.l. in the districts of Echarati and Megantoni and is present in 11 native Amazonian communities, as shown in Table 1. The flowers are approximately 7.27 cm long and 4.89 cm wide on average, and the color of the petals ranges from yellow to cream ( Figure 1E), with the presence of red to purple macules.
This botanical variety was observed in three communities in the district of Echarati (Table 1), distributed in small cotton plots and always associated with cassava (Manihot esculenta Crantz), papaya (Carica papaya L.), plantain (Musa spp.) and orange (Citrus sinensis (L.) Osbeck) crops. The soils where the samples of this species were collected have a high content of organic matter and are always on a regular slope. This species may be observed in the front of some houses and as part of ornamental gardens alongside roads. In the district of Megantoni, it was observed in eight Amazonian native communities (Table 1), distributed in parcels of 1 to 2 ha as a monoculture on flat land with a clay soil texture and few nutrient reserves. G. hirsutum L. has also been described in the Province of La Convención, located between 364 and 506 m.a.s.l., in the districts of Echarati and Megantoni. The species was found in three native Amazonian communities (Nuevo Mundo, Monte Carmelo, and Poyentimari), as shown in Table 1. It was the least widely distributed: only 19 samples were found (Table 1), whereas for G. barbadense L. var. brasiliensis and G. barbadense L., a total of 75 and 44 samples were found, respectively.
When characterizing the flower of this species, an average length and width of approximately 7.88 cm and 4.45 cm were observed, with yellow to cream-colored petals, without the presence of red to purple macules ( Figure 1G). Seeds are loosely arranged and completely covered by olive green linters ( Figure 1H), varying from 7 to 9 seeds, ovoid to oval in shape, and mostly dark brown. The fibers are smoke white ( Figure 1H).
This species was introduced and is distributed in Echarati, where it can be observed in two native Amazonian communities (Table 1). Its distribution is subspontaneous on hillsides and in small cotton plots. It is always associated with crops such as cassava (Manihot esculenta Crantz), papaya (Carica papaya L.), plantain (Musa spp.) and orange (Citrus sinensis (L.) Osbeck). The soils where this species was located had a high organic matter content and were always on a regular slope. Similar to the other described species, it was also observed in the front of some houses or alongside roads as an ornamental plant. In the district of Megantoni, these plants may be found in parcels of less than 1 ha, where they are associated with the cultivation of Zea mays on flat ground with a poor clay soil texture near the banks of the lower Urubamba River.

Unidentified Samples of Gossypium spp.
During the characterization process, the collected samples were classified according to the qualitative characteristics of species described in the literature. As shown in Figure 2, within this process, three groups of samples stood out, as they shared the characteristics of more than one species and, at the same time, differed from the two identified species (Table 1). Among the three groups of unknown samples, one sample was identified in the Amazonian native community of Poyentimari (G1) in the district of Echarati (Figure 2A,D). A defining characteristic of this sample was that the seeds were completely covered with Among the three groups of unknown samples, one sample was identified in the Amazonian native community of Poyentimari (G1) in the district of Echarati (Figure 2A,D). A defining characteristic of this sample was that the seeds were completely covered with olive green linter; they were distinct in shape, and the color of their fiber was white with a creamy hue ( Figure 2D). The flowers averaged approximately 7.72 cm in length and 4.61 cm in width, and the petal color ranged from pale yellow to whitish cream, with purple macules present (Figure 2A).
A second sample ( Figure 2B,E) found in the Koribeni Native Community (G2) was identified as being distinct from the two species described in the sampled areas. The characteristics of this sample were that the flower had purple maculae with green linter throughout the seed ( Figure 2E). The flowers were 8.5 cm long by 3.48 cm wide ( Figure 2B). The plants were associated with cacao (Theobroma cacao L.), masasamba (Annona muricata L.), and other medicinal plants. The villagers commented that this species has been in cultivation for more than 10 years; it was only found in one area (Table 1). Figure 2c,f show the particular characteristics of the third group of cotton samples found in the Timpia community (G3) but not in the Meganto-ni district ( Table 1). The flower had purple maculae, measuring 6.90 cm in length by 5.12 cm in width ( Figure 2C). The seeds were kidney-shaped and had olive green linters throughout ( Figure 2F). This plant was observed on the banks of the lower Urubamba in a monoculture plot in areas larger than 1 ha. The villagers commented that this species has been in cultivation for more than 8 years; it was only found in one area (Table 1).

Quantitative Characterization of the Collected Samples
The results of the analysis of variance performed on the quantitative descriptors obtained when comparing species of Gossypium spp. collected in the province of La Convención are shown in Table 2. There were highly significant differences (p < 0.001) between all the species described for all the descriptors used. Low coefficients of variation (<6%) showed the high level of precision of the data obtained when evaluating these field-collected samples. When the mean values of each of the nine quantitative descriptors were compared, it was observed that there was phenotypic diversity if we considered that within each of them, there were significant differences according to the Tukey's test (Table 3). For flower length, bract length, leaf length, and leaf width, the highest (8.5 cm, 6.94 cm, 25.48 cm, and 29.8 cm, respectively) and lowest (6.9 cm, 5.74 cm, 12.74 cm, and 15.54 cm, respectively) values corresponded to the unknown samples of groups G2 and G3, respectively (Table 3). For FW and CL, a similar behavior with inversion of positioning was observed, with the highest values for these descriptors being observed in the unknown samples of the G3 group (5.12 cm and 7.30 cm, respectively) and the lowest in the G2 (3.48 cm and 5.84 cm, respectively) group (Table 3). This finding revealed the potential of these unidentified groups for these descriptors. For the BW descriptor, the highest and lowest values were obtained for the unknown samples of groups G1 (6.98 cm) and G3 (3.88 cm) ( Table 3). For HP, samples from the unknown group G2 (3.58 m) and G1 (3.11 m) stood out, with the highest and lowest values for this descriptor, respectively. These data allowed us to conclude that the differentiation among the described species was evident, possibly indicating new genotypes and the potential of practicing selection in these groups.
For the CW descriptor, the highest values were obtained for the species G. hirsutum L. (3.4 cm) and the samples of the unknown group G2 (3.56 cm), which did not differ statistically between them (Table 3). For this same descriptor, the lowest values were obtained for the samples of the unknown group G3 (2.74 cm) and G. barbadense L. var. brasiliensis (2.83 cm), with no differences between them.

Correlations of the Quantitative Descriptors Obtained among the Groups Derived from the Collected Samples
Pearson correlations were established among the nine quantitative descriptors when considering five groups of cotton (G. barbadense L., G. barbadense L. var. brasiliensis, G. hirsutum L.) and the unknown groups G1 and G3, collected in the province of La Convención, in the districts of Echatari and Megantoni, Cusco-Peru ( Figure 3). The data show that there were strong correlations (>70%) between the descriptors FL × BL (83.0%, p < 0.001), BW (81.7%, p < 0.001), and LL (77.8%, p < 0.001), FW × LW (−72.5%, p < 0.001), BL and BW (77.9%, p < 0.001), BW × LL (84.1%, p < 0.001), CW × LL (78.4.7%, p < 0.001) and LW (75.8%, p < 0.001), and LL × LW (84.1%, p < 0.001) ( Figure 2). The lowest correlations were observed between FL × FW (0.004%), BW × FW (0.078%), HP × CL (0.07%), and LW × CL (0.04%), all of which were not significant ( Figure 3). Figure 3 also shows the dispersion of the data for each of the groups in the combinations of descriptors (below the main diagonal) and the normality distribution of the data (main diagonal). The first column of Figure 3 shows the distribution of classes within the five groups evaluated in relation to each descriptor evaluated. This column shows that for most of the quantitative descriptors, the population of G. barbadense L. var. brasiliensis showed the highest number of classes associated with the existing variation within the five groups evaluated according to each descriptor ( Figure 3). In the first line of Figure 3, the means are also represented using box-plot graphs, showing that the black group (unknown group G1) showed the highest mean values for most of the descriptors, as described in Table 3, revealing the superiority of this group compared to the two described species and the other unknown group G3. Unknown group G2 was not included in this analysis because it contained only one sample. in the districts of Echatari and Megantoni, Cusco-Peru ( Figure 3). The data show that there were strong correlations (>70%) between the descriptors FL × BL (83.0%, p < 0.001), BW (81.7%, p < 0.001), and LL (77.8%, p < 0.001), FW × LW (−72.5%, p < 0.001), BL and BW (77.9%, p < 0.001), BW × LL (84.1%, p < 0.001), CW × LL (78.4.7%, p < 0.001) and LW (75.8%, p < 0.001), and LL × LW (84.1%, p < 0.001) (Figure 2). The lowest correlations were observed between FL × FW (0.004%), BW × FW (0.078%), HP × CL (0.07%), and LW × CL (0.04%), all of which were not significant ( Figure 3).   Figure 3 shows the distribution of classes within the five groups evaluated in relation to each descriptor evaluated. This column shows that for most of the quantitative descriptors, the population of G. barbadense L. var. brasiliensis showed the highest number of classes associated with the existing variation within the five groups evaluated according to each descriptor ( Figure 3). In the first line of Figure 3, the means are also represented using box-plot graphs, showing that the black group (unknown group G1) showed the highest mean values for most of the descriptors, as

Qualitative Characterization of the Collected Samples
Qualitative characterizations of the samples collected in the field were carried out by evaluating 16 descriptors for the cotton crop; the different classes are described in Table 4. Among the descriptors evaluated, the CE associated with the cotton speck showed the greatest diversity, as 18 different colors were found within it (Table 4). For this descriptor, G. barbadense L. and G. barbadense L. var. brasiliensis showed 10 and 16 colors, respectively, as a sample of the wide diversity of colors that can be found for these species in Peruvian Amazonian communities ( Figure 4C). The species G. hirsutum L. showed only two colors [smoke white ( Figure 4) and white ( Figure 4D)], showing low variability for this descriptor. The nine unknown group members showed only one color for each of the groups, with colors ranging from creamy white in G1 ( Figure 4S), white in G2 ( Figure 4D), to brownish in G3 ( Figure 4R). For this descriptor, Figure 4 shows a sample of the 18 colors found in this study as a representation of the diversity found for this descriptor in Peruvian Amazonian communities.
Another descriptor that showed the second highest number of classes was CB with 6, followed by CSF and CF with 5; NS, CS, DB, CC, and PPL with 4; SS and SLSC with 3; and PSL, PSA, DS, PTC and SL with only 2 classes (Table 4). For the species G. barbadense L. and G. hirsutum L., as well as the variety G. barbadense L. var. brasiliensis, variability was observed for most of the qualitative descriptors, except for DS, CS, CC, SL, and PPL, which showed only a single class within each of the groups (Table 4). For the three unidentified groups, the presence of a single class within the different descriptors was more common, except for the descriptors DB, SLSC, and PTC, which showed only two classes for group G1 (Table 4).

Principal Component Analysis (PCA) Using the Qualitative and Quantitative Descriptors
Data from the nine quantitative and 16 qualitative descriptors obtained from characterizing 147 cotton samples collected in Amazonian communities of Peru were used to perform a principal component analysis; the results are shown in Figure 5. When considering the data from the quantitative descriptors ( Figure 5A), a high level of variation was observed for the first two components, i.e., describing 74.7% of the variability of the data. For this data set, the a priori characterized samples (groups and samples of unknown origin) were grouped together, mainly for the samples belonging to G. barbadense L. (Blue), G. barbadense L. var. brasiliensis (red) and G. hirsutum L. (green) ( Figure 5A). This type of

Principal Component Analysis (PCA) Using the Qualitative and Quantitative Descriptors
Data from the nine quantitative and 16 qualitative descriptors obtained from characterizing 147 cotton samples collected in Amazonian communities of Peru were used to perform a principal component analysis; the results are shown in Figure 5. When considering the data from the quantitative descriptors ( Figure 5A), a high level of variation was observed for the first two components, i.e., describing 74.7% of the variability of the data. For this data set, the a priori characterized samples (groups and samples of unknown origin) were grouped together, mainly for the samples belonging to G. barbadense L. (Blue), G. barbadense L. var. brasiliensis (red) and G. hirsutum L. (green) ( Figure 5A). This type of data or group formed by sample G2 (brown) was distant from those of the described species, showing that this sample was divergent for these descriptors and deserves attention in future studies. The samples of group G1 (black) were located between the groups formed by G. barbadense L. var. brasiliensis (red) and G. hirsutum L. (green) or possibly evidenced crosses between the two ( Figure 5A). The samples of group G3 (yellow) were located close to the group comprising G. barbadense L. var. brasiliensis (red); at the same time, this group was closely associated with the descriptor FW, which. among all of the descriptors, was the one with the highest contribution, followed by CL and LW ( Figure 5A). descriptors, was the one with the highest contribution, followed by CL and LW ( Figure  5A). When considering the 16 qualitative descriptors, the PC showed that the first two components retained 48.2% of the total variation in the data ( Figure 5B). The analysis showed that the groups formed by the species G. barbadense L. (Blue) and G. hirsutum L. (Green), as well as the variety G. barbadense L. var. brasiliensis (red), were distant from each other, indicating that these characteristics determined the differences between these three cotton species (Figure 5b). The species G. barbadense L. (Blue) showed a strong association with the descriptors DS, PSA, SLSC, and PTC due to the proximity, magnitude, and direction of the arrows representing each descriptor. The variety G. barbadense L. var. brasiliensis (red) strongly correlated with the descriptors CS, PSL, CSF, CE, and NS ( Figure 5B). The species G. hirsutum L. (green) was slightly more distant from the other two varieties described above and was only associated with the descriptor SL ( Figure 5B). For this set of qualitative descriptors, the unknown samples of groups G2 and G3 were located close to G. hirsutum L. (green) and G. barbadense L. var. brasiliensis (red) ( Figure 5B). On the other hand, the unknown sample of group G1 (black) was located close to the members of G. barbadense L. var. brasiliensis (red) and G. barbadense L. (blue). The location of these unknown groups (G1, G2, and G3), as well as that shown with the quantitative data ( Figure  5A), provided evidence that these groups are the product of the hybridization of the de- When considering the 16 qualitative descriptors, the PC showed that the first two components retained 48.2% of the total variation in the data ( Figure 5B). The analysis showed that the groups formed by the species G. barbadense L. (Blue) and G. hirsutum L. (Green), as well as the variety G. barbadense L. var. brasiliensis (red), were distant from each other, indicating that these characteristics determined the differences between these three cotton species (Figure 5b). The species G. barbadense L. (Blue) showed a strong association with the descriptors DS, PSA, SLSC, and PTC due to the proximity, magnitude, and direction of the arrows representing each descriptor. The variety G. barbadense L. var. brasiliensis (red) strongly correlated with the descriptors CS, PSL, CSF, CE, and NS ( Figure 5B). The species G. hirsutum L. (green) was slightly more distant from the other two varieties described above and was only associated with the descriptor SL ( Figure 5B). For this set of qualitative descriptors, the unknown samples of groups G2 and G3 were located close to G. hirsutum L. (green) and G. barbadense L. var. brasiliensis (red) ( Figure 5B). On the other hand, the unknown sample of group G1 (black) was located close to the members of G. barbadense L. var. brasiliensis (red) and G. barbadense L. (blue). The location of these unknown groups (G1, G2, and G3), as well as that shown with the quantitative data ( Figure 5A), provided evidence that these groups are the product of the hybridization of the described species. However, further studies are required to confirm this.
When we combined both types of data (qualitative and quantitative) in the PCs, we found that 51.8% of the variability of the data was retained in the first two components ( Figure 6). As observed in Figure 5, the groups associated with the described and unknown Gossypium species (G1, G2, and G3) maintained the groupings and the distances between them [G. barbadense L. (Blue), G. barbadense L. var. brasiliensis (red) and G. hirsutum (green)] ( Figure 6). For the unknown groups, G1 was located between G. barbadense L. var. brasiliensis (red) and G. hirsutum L. (Green), G2 was located close to G. hirsutum L. (Green), and G3 was located close to the group formed by the samples of G. barbadense L. var. brasiliensis (red) (Figure 6).

Discussion
Currently, there are difficulties in identifying and determining the exact distribution of the cotton populations of the Gossypium species, which prevents clearer information about this genus from being made available [6]. There are reviews on the classification of this genus [2,3,24], but there are gaps in the information on the distribution, and the zones are unknown, for example, in the Peruvian Amazon, making it difficult to adequately delimit the species. With this in mind, our work shows the distribution of cottonwood spe-

Discussion
Currently, there are difficulties in identifying and determining the exact distribution of the cotton populations of the Gossypium species, which prevents clearer information about this genus from being made available [6]. There are reviews on the classification of this genus [2,3,24], but there are gaps in the information on the distribution, and the zones are unknown, for example, in the Peruvian Amazon, making it difficult to adequately delimit the species. With this in mind, our work shows the distribution of cottonwood species found and characterized after visiting two districts (Echarati and Megantoni) in the Province of La Convención and finding cottonwoods conserved and distributed in 12 A.N.C. From this fieldwork, 147 samples were identified, and the presence of two species [G. barbadense L. (30%) and populations of the introduced species G. hirsutum L. (13%)], one botanical variety G. barbadense L. var. brasiliensis (51%), and in 6% of the total samples collected [community of Timpia (5 samples), Koribeni (1 sample) and Poyentimari (3 samples)], the evaluated characteristics did not allow us to identify the samples at the species level. However, this is the first report of the presence of G. hirsutum L. in the Peruvian Amazon. The conservation and distribution levels found for G. barbadense L. and G. hirsutum L. showed the economic importance they represent for Amazonian communities, owing to the quality of their fibers, as already described by Ozyigit [25] and Ahmed et al. [26].
The genus Gossypium comprises more than 50 recognized species distributed in arid and semiarid zones in the tropics and subtropics [27]. Four species were domesticated independently to take advantage of their fibers: G. arboreum, G. herbaceum, G. hirsutum, and G. barbadense [2,27]. The description provided by MINAM [2] did not report the presence of G. hirsutum L. in the region of Cusco; however, in the regions of Cajamarca, Lambayeque, and Pasco, these species can be found. On the other hand, it is known that G. hirsutum L. is an introduced and cultivated species [28]; its origin is Mexico [9], and its fiber is very important for the countries that cultivate it [10,21]. The accounts of the natives living in these localities confirmed the use of this species and showed the importance of these cotton species among indigenous communities. The information obtained in this work shows that there were only incomplete data for the districts of the Cusco region, which underlines the importance of the present study.
G. barbadense L. has been reported in 24 regions in Peru [2]. This species, which is thought to originate from northwestern South America, has wild forms and a high level of diversity; hence, the hypothesis is that this species originated in northern Peru and southern Ecuador [6,29]. The importance of this species lies in the quality of its fiber and its improved varieties, destined for spinning and weaving for export and use by native communities [6]. However, Lazo [6] and MINAM [2] reported that this species arrived in the jungle by dispersal in the Amazon, a process that occurred from domesticated forms, probably from what is now Peru, Ecuador, and Colombia, via river networks between Colombia and Peru. This species was found in 8 of the 12 sampled communities, making up 81% of the samples collected. It showed wide diversity for the quantitative (Table 3) and qualitative (Table 4) characteristics of the species in these regions.
According to Lazo [24], G. barbadense L. var. brasiliensis is of the Amazonian type, with an undetermined origin. It has been noted that it may have spread through the Apure River to the northwest of Colombia, being of interest for the study of the west and east of the Andes. This variety within G. barbadense L. was more prevalent (51% of the collected samples) in the districts of Echarate and Megantoni. This result agrees with the studies of Lazo [24], who stated that it was in these same districts that this species has spread the most since its establishment in Amazonia. This variety has managed to adapt to humid tropical conditions and is found in Argentina, Bolivia, Paraguay, northeastern Brazil, the Guianas, and northern South America [2,6,24]. MINAM [30] noted that G. barbadense L. var. brasiliensis has peculiar characteristics, i.e., its seeds are welded or kidney-shaped, as also observed in our research ( Figure 2F). Comments from the inhabitants of the native Amazonian communities indicated that this cotton is the one they wish to cultivate since, at the time of spinning, it is easier to remove, unlike G. hirsutum L. and G. barbadense L., because its seeds are dispersed in its specks and the fiber is relatively easy to extract. This species is important in the Amazonian communities that cultivate it, constituting a means of production of typical clothing and handicrafts, and therefore, an important source of income [31].
Currently, there is controversy because there is evidence of the introduction of imported yarn from other countries, which means that villagers do not need to grow cotton. This has led to the loss of customs and interest in planting, although these are preserved at present, as indicated in our research (Table 1). Our study also verified the distribution of these species (94% of the collected samples). For the first time, we report the emergence of new genotypes, as shown in the three groups of species not identified among the collected plants (6% of the collected samples). Although these three groups were close to the described species, the PCs showed that they may be the result of natural hybridization between species (Figures 5 and 6), sharing specific characteristics among them while diverging from the described species. This account shows that the diversity of the species, far from being lost, is increasing. Despite their use by local communities, it is necessary to study these unidentified species [2,6,24,30], as they have very good fiber quality while differing from the species that have been found fully identified. This will contribute to reducing the scientific information gap for the described species and the new groups described in this research.
The Cusco region is very popular with tourists who come to Machupichu and stay in the province of Urubamba, which borders the province of La Convención. One of the attractions for tourists is the handicrafts and clothing that the communities produce using the variety of colors that these species provide; our study described 18 different colors (Table 4 and Figure 4). It is speculated that seeds may have been brought by tourists visiting these tourist sites. On the other hand, conversations with the inhabitants of the different Amazonian native communities of Echarati and Megantoni affirmed that many seeds were brought from other departments to improve the quality of the cotton fibers. In addition, others were collected from the wild to select and improve their fiber, without knowing which species they were introducing and the alterations that could occur due to the hybridization that occurs among these species [25,26]. These accounts may explain why 6% of samples did not belong to the two species that our study and previous works have reported for this region [25,26].
As a result of the present work, we found the distribution of two species and one variety of cotton and provided a description of three groups of unidentified samples that have been found in different native Amazonian communities, where hybridizations or speciation may have already occurred. For these three samples, described for the first time here, the application of 9 quantitative and 16 qualitative descriptors associated with principal component analysis allowed us to describe the characteristics of these new groups within the described species. The accounts of the settlers of the different Amazonian native communities affirmed that these new groups have already been there for between 8 and 10 years, although will be verified in later studies. This study lays the foundation for future studies, allowing us to use molecular tools to clarify the origins of these plants and properly place them within the genus Gossypum. With this information, we confirm that the diversity of the species for the characterized regions is relevant and that the in situ maintenance and conservation carried out by the Amazonian communities is important, including for investigations such as those proposed in this work, which will allow us to elucidate the distribution of the genus Gossypium in this region of Peru. The collections and evaluations were conducted during the period from April to December 2021. Permission was requested from the native Amazonian communities for the collection. Samples of cotton (Gossypium spp.) were collected in 12 native Amazonian communities in the districts of Echarati and Megantoni. The type of sampling was nonprobabilistic purposive, by convenience [32]. Within each community, collections were carried out in all places where Gossypium spp. was cultivated or found naturally, according to the collection and pressing procedures recommended by MINAM [3].

Qualitative Descriptors Evaluated
Once the samples had been collected, characterization at the species level was carried out according to the following parameters.
In the plants, the following descriptors were evaluated for SPECK: the color of the speck (cotton balls); for FLOWERs: the presence of spots, color of the flower spots, flower color, and position of the stigma in relation to the anthers; for SEEDS: arrangement, number, main characteristics, color, and shape; for BRACTS: types of serration and color; for CAPSULES: shape in longitudinal section, characteristics and prominence of the tip; and The collections and evaluations were conducted during the period from April to December 2021. Permission was requested from the native Amazonian communities for the collection. Samples of cotton (Gossypium spp.) were collected in 12 native Amazonian communities in the districts of Echarati and Megantoni. The type of sampling was nonprobabilistic purposive, by convenience [32]. Within each community, collections were carried out in all places where Gossypium spp. was cultivated or found naturally, according to the collection and pressing procedures recommended by MINAM [3].

Qualitative Descriptors Evaluated
Once the samples had been collected, characterization at the species level was carried out according to the following parameters.
In the plants, the following descriptors were evaluated for SPECK: the color of the speck (cotton balls); for FLOWERs: the presence of spots, color of the flower spots, flower color, and position of the stigma in relation to the anthers; for SEEDS: arrangement, number, main characteristics, color, and shape; for BRACTS: types of serration and color; for CAPSULES: shape in longitudinal section, characteristics and prominence of the tip; and finally, for LEAF: shape and presence of pubescence. All these descriptors were evaluated following the recommendations of Manco Céspedes et al. [33] and MINAM [2].
All the descriptors were tabulated, and different classes were established for the different species, which allowed us to undertake a descriptive study of the collection sites and the distribution of the main cotton species collected in each locality.

Quantitative Descriptors Assessed
The length and width of the flowers, bracts, capsules, and leaves were determined for all samples. All these measurements were made in cm using a ruler. The height of the plants was also determined in meters with the aid of a ruler. The evaluations followed the recommendations of Manco Céspedes et al. [33] and MINAM [2]. The information generated was tabulated and used in subsequent analyses.

Statistical Analysis
To verify the relationship between the quantitative descriptors and the groups of species, ANOVA was initially performed considering the unbalanced treatments (species), and when significant, the means were compared by Tukey's test at 1% probability. Using the quantitative data, a scatter plot was made showing the Pearson correlations and the dispersion of each characteristic. The quantitative and qualitative data were used to perform a multivariate analysis using the PCA method to see the overall variability of the two individual datasets and combined for the various groups formed, as well as the trends of the groupings and descriptors. Rbio software [34] and SigmaPlot 10.0 ® (Systat Software Inc.) were used for the elaboration of the graphs, and R [35] with the "GGally" package was used to generate the correlation plot.

Conclusions
The present study allowed us to identify two species of Gossypium spp. (G. barbadense L. and G. hirsutum L.) and a botanical variety G. barbadense L. var. brasiliensis, located in the districts Echarati and Megantoni, along with three groups of samples that could not be identified using the qualitative and quantitative descriptors evaluated. This report contributes to the information that has been presented for this region of the Peruvian Amazon, verifying that there is phenotypic variability for the species Gossypium spp. and that conservation efforts are being made by the indigenous people who populate these places. The multivariate analysis tools and the evaluation of different descriptors allowed us to affirm that variability exists within the described species and that the new groups identified may be the result of spontaneous crosses between the described species of Gossypium spp.; further studies will be necessary to confirm this hypothesis. The results also show the existence of G. hirsutum L. in this zone, and the greatest distribution was observed for the species G. barbadense L.